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, Abstract 

o . 

, The capacity of peak-power limited, single-antenna, noncoherent, flat-fading channels 

with memory is considered. The emphasis is on the capacity pre- log, i.e., on the limiting 
ratio of channel capacity to the logarithm of the signal-to- noise ratio (SNR), as the SNR 
tends to infinity. It is shown that, among all stationary & ergodic fading processes of a 
given spectral distribution function and whose law has no mass point at zero, the Gaussian 
process gives rise to the smallest pre-log. The assumption that the law of the fading process 
has no mass point at zero is essential in the sense that there exist stationary & ergodic 
fading processes whose law has a mass point at zero and that give rise to a smaller pre- 
log than the Gaussian process of equal spectral distribution function. An extension of our 
results to multiple-input single-output fading channels with memory is also presented. | 
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1 Introduction 



We study the capacity of peak-power limited, single-antenna, discrete-time, flat-fading channels 
with memory. A noncoherent channel model is considered where the transmitter and receiver 
r**"* . are both aware of the law of the fading process, but not of its realization. Our focus is on the 

capacity at high signal-to- noise ratio (SNR). Specifically, we study the capacity pre-log, which is 
defined as the limiting ratio of channel capacity to the logarithm of the SNR, as the SNR tends 
' to infinity. 

The capacity pre-log of Gaussian fading channels was derived in [T] (see also [2]). It was 
shown that the pre-log is given by the Lebesgue measure of the set of harmonics where the 
derivative of the spectral distribution function that characterizes the memory of the fading 
process is zero. To the best of our knowledge, the capacity pre-log of non-Gaussian fading 
channels is unknown. 

In this work, we demonstrate that the Gaussian assumption in the analysis of fading channels 
at high SNR is conservative in the sense that for a large class of fading processes the Gaussian 
process is the worst. More precisely, we show that among all stationary & ergodic fading 
processes of a given spectral distribution function and whose law has no mass point at zero, the 
Gaussian process gives rise to the smallest pre-log. 

This paper is organized as follows. Section [5] describes the channel model. Section [3] defines 
channel capacity and the capacity pre-log. Section [4] presents our main results. Section [5] 
provides the proofs of these results. Section [5] discusses the extension of our results to multiple- 
input single-output (MISO) fading channels with memory. Section [7] concludes the paper with 
a summary and a discussion of our results. 

2 Channel Model 

Let C and Z denote the set of complex numbers and the set of integers. We consider a single- 
antenna flat-fading channel with memory where the time-fc channel output 5^ £ C corresponding 
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to the time-A: channel input x k £ C is given by 

Y k = H k x k + Z k , fc e Z. (1) 

Here the random processes {Z k , k £ Z} and {H k , k £ Z} take value in C and model the addi- 
tive and multiplicative noises, respectively. It is assumed that these processes are statistically 
independent and of a joint law that does not depend on the input sequence {x k }. 

The additive noise {Z k , k £ Z} is a sequence of independent and identically distributed 
(IID) zero- mean, variance-er 2 , circularly-symmetric, complex Gaussian random variables. The 
multiplicative noise ("fading") {H k , k £ Z} is a mean-d, unit-variance, stationary & ergodic 
stochastic process of spectral distribution function F(X), —1/2 < A < 1/2, i.e., F(-) is a bounded 
and nondecreasing function on [—1/2, 1/2] satisfying 

,1/2 

E[(H k+m -d){H k -d)*]= e l2 ™ A dF(A), (fc e Z, meZ), (2) 

J-l/2 

where i = \/— T, and where A* denotes the complex conjugate of A [3l p. 474, Thm. 3.2]. Since 
F(-) is monotonic, it is almost everywhere differentiable, and we denote its derivative by F'(-). 
(At the discontinuity points of F(-) the derivative F'(-) is undefined.) For example, if the fading 
process {H kl k £ Z} is IID, then 

F'(A) = 1, ~\<X<\. 



3 Channel Capacity and the Pre-Log 

Channel capacity is defined as the supremum of all achievable rates. (We refer to [H Ch. 8] for 
a definition of an achievable rate and for a more detailed discussion of channel capacity.) It was 
shown (e.g., [U Thm. 2]) that the capacity of our channel (fl]) under a peak-power constraint A 2 
on the inputs is given by 

C(SNR) = lim isup/(X";r™); (3) 



A 2 



where SNR is defined as 

SNR^l^; (4) 

A^j denotes the sequence A m , . . . , A n ; and where the maximization is over all joint distributions 
on X\ , . . . , X n satisfying with probability one 

|X fc | 2 <A 2 , k = l,...,n. (5) 

The capacity pre-log is defined as [T] 

SNR^oo log SNR V ^ 

For Gaussian fading, i.e., when {H k — d, k € Z} is a circularly-symmetric, complex Gaussian 
process, the pre-log IIq is given by the Lebesgue measure of the set of harmonics where the 
derivative of the spectral distribution function is zero, i.e., 

n G = M ({A:F'(A)=0}), (7) 

where /i(-) denotes the Lebesgue measure on the interval [—1/2,1/2]; see [T], [2]. (Here the 
subscript "G" stands for "Gaussian".) 

This result indicates that if the fading process is Gaussian and satisfies 

H({\:F'(\)=0})>0, 

then the corresponding channel capacity grows logarithmically in the SNR. Note that otherwise 
the capacity can increase with the SNR in various ways. For instance, in [6] fading channels are 
studied that result in a capacity which increases double-logarithmically with the SNR, and in 
PQ spectral distribution functions are presented for which capacity grows as a fractional power 
of the logarithm of the SNR. 
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4 Main Result 

We show that, among all stationary & ergodic fading processes of a given spectral distribution 
function and whose law has no mass point at zero, the Gaussian process gives rise to the smallest 
pre-log. This is made precise in the following theorem. 

Theorem 1. Consider a mean-d, unit-variance, stationary & ergodic jading process {H k , k G Z} 
whose spectral distribution function is given by F(-) and whose law satisfies 

Pr[H k = 0] = 0, fcGZ. 

Then the corresponding capacity pre-log II is lower bounded by 

n> M ({A:F'(A)=0}). (8) 

Proof. See Section O □ 

The assumption that the law of the fading process has no mass point at zero is essential in 
the following sense. 

Note 1. There exists a mean-d, unit-variance, stationary & ergodic fading process {H k , teZ} 
of some spectral distribution function F(-) such that 

II< M ({A:F'(A)=0}). (9) 

By Theorem [H this process must satisfy 

Pv[H k = 0] > 0, k G Z. 

Proof. See Section O □ 

Note 2. The inequality in @ can be strict. For example, consider the phase-noise channel 
with memoryless phase noise. This channel can be viewed as a fading channel where the fading 
process {H k , k G Z} is given by 

H k = e' e \ fcGZ, 

and where {Ok, k G Z} is IID with Q k being uniformly distributed over [— tt, it). This process 
gives rise to a pre-log IT = 1/2, whereas the Gaussian fading of equal spectral distribution function 
yields Hq = 0. 

Proof. For a derivation of the capacity pre-log of the phase-noise channel see Section 15.31 □ 



5 Proofs 

This section provides the proofs of our main results. For a proof of Theorem [1] see Section I5~T1 
for a proof of Note Q] see Section 15 . 21 and for a proof of Note [2] see Section 15.31 

5.1 Proof of Theorem [I] 

To prove Theorem [TJ we derive in Section 15.1.11 a lower bound on the capacity, and proceed in 
Section 15.1.21 to analyze its asymptotic growth as the SNR tends to infinity. 

5.1.1 Capacity Lower Bound 

To derive a lower bound on the capacity we consider inputs {X k , fcGZ} that are IID, zero- mean, 
circularly-symmetric, and for which \X k \ 2 is uniformly distributed over the interval [0, A 2 ] . Our 
derivation is based on the lower bound 

-/(*?; Yi) > -I{X?\ Yr\H[ l ) - -I(H{ 1 - Y?\X?), (10) 
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which follows from the chain rule 



I(X?; Y") = I(X?, H™;Yl l ) ~ I(H?; Y"\X") 

= I(H{ 1 ; T") + I(X?; Y?\H2 ) - I(H?;Y?\X?) (11) 

and the nonnegativity of mutual information. 

We first study the first term on the right-hand side (RHS) of (fTUl) . Making use of the 
stationarity of the channel and of the fact that the inputs are IID we have 

i/(x 1 " ; y 1 "|£f») = /(x i; y 1 | J E 1 ). (12) 

n 

We lower bound the RHS of (fT2")l as follows. For any fixed T > 

I(Xi;Yi\Hi) = h{H x X x + Z l \H 1 ) - h(Z 1 ) 

= [ h(H 1 X 1 + Z 1 \H 1 =h 1 )dP Hl (h 1 ) 
J\hi\>r 

+ f h{H 1 X 1 + Z 1 \H 1 = h 1 )dP Hl {hi)-h{Z 1 ) 
J\hi\<r 

> I h(H 1 X 1 + Z 1 \H 1 =h 1 )dP Hl (h 1 )+-Pr[\H 1 \<?]h(Z 1 )-h(Z 1 ) 
J\hi\>r 

> f (log\hi\ 2 + h(Xi)) dP Hl (h 1 )+ Pt[\Hi\ <T]h{Z 1 )-h{Z 1 ) 
J\hi\>~t 

> Pr[|#i| > T] (logT 2 + h(Xij) + Pr[|ffi| < T] h(Z\) - h{Z x ) 

= Pr[|#! | > T] (log T 2 + log tt + h(\X x | 2 )) + Prtl^i | < T] h{Z x ) - h(Z x ) 
= Pr[|ffi| > T]logA 2 + Pr[|Hi| > T] log (ttT 2 ) + Pr[\Hi\ < T]h{Z 1 ) - h{Z x ) 
= Pr[|ff x | > T]logA 2 + Pr[ji?i| > T] log (ttT 2 ) + (Pr[|ffi| < T] - l)log(^ea 2 ) 
= Pr[|i?i | > T] log SNR - Pr[|#i | > T] (l - log T 2 ) , (13) 

where Ptixi ) denotes the distribution function of the fading H\. Here the third step follows by 
conditioning the entropy in the second integral on X\, the fourth step follows by conditioning 
the entropy in the first integral on Z\ and by the behavior of differential entropy under scaling 
[U Thm. 9.6.4]; the fifth step follows because over the range of integration \hi\ > T we have 
log \h\\ 2 > logT 2 ; the sixth step follows because X\ is circularly-symmetric [5J Lemma 6.16]; the 
seventh step follows by computing the entropy of a random variable that is uniformly distributed 
over the interval [0, A 2 ] ; the eighth step follows by evaluating the entropy of a zero-mean, 
variance-er 2 , circularly-symmetric, complex Gaussian random variable h(Zk) = log(7re<7 2 ); and 
the last step follows from Pr[|i2i| > T] = 1 - Pr[|i2i| < T]. 

We next turn to the second term on the RHS of (|10p . In order to upper bound it we proceed 
along the lines of [7], but for non-Gaussian fading. Let Y, H, and Z be the random vectors 
(Yi, . . . , Y n ) J , (Hi, . . . , H n y , and (Z x , . . . , Z n ) J (where A T denotes the transpose of A), and let 
X be a diagonal matrix with diagonal entries xi, . . . , x n . It follows from |T]) that 

Y = XH + Z. (14) 

The conditional covariance matrix of Y, conditional on x\, . . . , x n , is given by 

(Y - E[Y]) (Y - E[Y]) f | X? = xf\ = XK HH Xt + a 2 l„, (15) 

where l„ is the n x n identity matrix, denotes Hermitian conjugation, and 

K HH ^E[(H-E[H])(H-E[H])t] . (16) 
Let det A denote the determinant of the matrix A. Using the entropy maximizing property of 
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circularly-symmetric Gaussian vectors [H Thm. 9.6.5], we have 



~I(H?;Y?\X?) 
n 



n n 



n 

1 ^ 



1 



logdet M„ 
logdet I \ n 



1 

1 



Khh^ 



< - logdet l„ + ^Khh 



logdet (l„ + SNR K HH ) 



n 

- Y>g(l + SNRA fe ), 



fc=l 



(17) 



where X is a random diagonal matrix with diagonal entries X%, . . . , X n , and where Ai, 
denote the eigenvalues of Khh- Here the third step follows from the identity det(l„ + AB) = 
det(l„ + BA); the fourth step follows from ([5]) which implies that A 2 I„ — X^X is positive semidef- 
inite with probability one; the fifth step follows from the definition of SNR (0]); and the last step 
follows because the determinant of a matrix is given by the product of its eigenvalues. 

To evaluate the RHS of (jTTJ) in the limit as n tends to infinity, we apply Szego's Theorem 
on the asymptotic behavior of the eigenvalues of Hermitian Toeplitz matrices [5] (see also 
Thm. 2.7.13]). We obtain 

1 1 " 

lim -l{m\Y?\X?)< lim - V log(l + SNR A fe ) 

n^oo fi n— >oo fi — ^ 



k=l 



1/2 



log(l + SNRF'(A))dA. 

1/2 

Combining ([TO]). t[T2|>. (fT3|) . and (IS]) yields the final lower bound 



(18) 



C(SNR) > Pr[|ffi | > T] log SNR - Pr[|ifi | > T] (1 - log T 2 ) 

,1/2 

log (1 + SNRF'(A)) dA, SNR > 0, (19) 

-1/2 

which holds for any fixed T > 0. Note that this lower bound applies to all mean-d, unit- variance, 
stationary & ergodic fading processes {Hk, fceZ] with spectral distribution function F(-). 



5.1.2 Asymptotic Analysis 

In the following we prove by computing the limiting ratio of the lower bound (fT9|) to log SNR 
as SNR tends to infinity. 
We first show that 



lim 

SNR^oo 



l / 2 log(l + SNRF'(A)) 



1/2 



log SNR 



dA = /i({A: F'{\) > 0}) 



(20) 



To this end, we divide the integral into three parts, depending on whether A takes part in the 
set Si, <S>2, or S3, where 



51 4 {AG [-1/2, 1/2] :*"(*)=()} 

5 2 = {Ae[-l/2,l/2]:F'(A)>l} 

5 3 = {A e [-1/2, 1/2] : < F'(X) < 1}. 

For A € Si the integrand is zero and hence 

log(l + SNRF'(A)) 



lim 

SNR^oo 



Si 



log SNR 



dA = 0. 



(21) 
(22) 
(23) 



(24) 
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For A <E 6>2, i- e -; when F'(X) > 1, we note that for sufficiently large SNR the function 

log(l + SNRF'(A)) 



SNR i 



log SNR 



is monotonically decreasing in SNR. Therefore, applying the Monotone Convergence Theorem 
[TGI Thm. 1.26], we have 

lim io g (i + SNR F( A)) dA _r Bm Mi + amfw) 



SNR^oo log SNR 7,S 2 SNR^oo log SNR 

= M(^ 2 ) 



^({A:F'(A)>1}). (25) 



For A S S3, i.e., when < F'(A) < 1, we have 



where the last step follows because, for sufficiently large SNR, the function 

log(l + SNR) 



SNR i 



log SNR 



is monotonically decreasing in SNR. Since log(l + e) is integrable over S3, we can apply the 
Dominated Convergence Theorem jTOJ Thm. 1.34] to obtain 

lim / Ml + SNRnA)) dA = f Hm lo g (l + SNR^(A)) dA 



SNR^ooJ 5;j log SNR Js 3 SNR^oo log SNR 

= M(5 3 ) 

= M ({A:0<F'(A)<1}). (27) 

Adding HMD, (ESD, and ([57} yields (f^U|) . 

To continue with the asymptotic analysis of (fH?)) we note that by ([50)) 

n- hm ^» 

SNR^oo log SNR 

> Pr[| J ff 1 |>T]-//({A:F'(A)>0}) 

= /i({A:^'(A)=Q})- Pr[|H x |<T] (28) 
for any T > 0. If the law of the fading process has no mass point at zero, then 

lim Pr[|#i| < T] = 0, (29) 

and ([5]) therefore follows from l[2"5[) by letting T tend to zero from above. 

5.2 Proof of Note Q] 

We prove Note [T] by demonstrating that there exists a stationary & ergodic fading process of 
some spectral distribution function F(-) for which 

U<^({X:F'(X)=0}). 

By Theorem [TJ the law of such a process must have a mass point at zero, i.e., 

Pr[H k = 0] > 0, k G Z. 

To this end, we first show that the capacity pre-log is upper bounded by 

II< Pr[\Hi\ >0]. (30) 



G 



Indeed, the capacity C(SNR) does not decrease when the receiver additionally knows the real- 
ization of {H k , k G Z}, and when the inputs have to satisfy an average-power constraint rather 
than a peak-power constraint, i.e., 

C(SNR)< lim -supI(X?;Y?\H?), (31) 

n— >oc Tl 

where the maximization is over all input distributions on X\ , . . . , X n satisfying the average-power 
constraint 

E[|X fc 

n ^ 

(This follows because the availability of additional information cannot decrease the capacity, 
and because any distribution on the inputs satisfying the peak-power constraint ([5]) satisfies also 
([32]) .) It is well known that the expression on the RHS of ([31]) is equal to 



i^i[W]. SNR . (32) 



lim - sup /(AT; Y"\m i ) = E [log(l + \H X \ 2 SNR)1 (33) 

n— >oo TL 

(e.g., eq. (3.3.10)]), which can be further upper bounded by 

E[log(l + |H 1 | 2 SNR)] = Pr[\Hi\ > 0] E [log(l + \Hi\ 2 SNR) | > 0] 
< Pr[|ffi| >0] log(l + E [ I^Txl 2 | >0] SNR) 

= P ^l>0]log(l + ^_). (34) 

Here the first step follows by writing the expectation as 

E[log(l + |#i| 2 SNR)] = Pr[ji?i| = 0] E [log(l + \Hi\ 2 SNR) | = 0] 

+ Pr[|^ | > 0] E [log(l + \H X \ 2 SNR) | |£Ti| > 0] , 

and by noting then that E[log(l + |iii| 2 SNR) | = 0] =0; the second step follows from 
Jensen's inequality; and the last step follows because E[|iii| 2 ] = 1, which implies 

E[ig 1 l 2 l]gt]>0]= pr[| ^ |>0] . 
Dividing the RHS of (|34| by log SNR, and computing the limit as SNR tends to infinity yields 

dSOD- 

In view of ([3H]) , it suffices to demonstrate that there exists a fading process of some spectral 
distribution function F(-) that satisfies 

Pr[\H 1 \>0]< f i({\:F'(\)=0}). (35) 

A first attempt of defining such a process (which, alas, does not work) is 

!. . . , 0, 0, 0, 0, . . . with probability 6 

(36) 
. . . , -B-i, So, B\, £?2, . . . with probability 1 — 8, 

where {Bk, k £ Z} is a zero-mean, circularly-symmetric, stationary & ergodic, complex Gaussian 
process of variance 1/(1 — S) and of spectral distribution function G(-); and where S and G(-) 
are chosen so that 

1-5 < n{{\:G'(\) =0}). (37) 
This process satisfies (f35]) because Pr[|_ffi| > 0] = 1 — S, and because 

E[(H k+m - d)(H k - d)*} = (l-S) E[B k+m B* k ] , (38) 

which implies that F(X) = (1 — d)G(X) almost everywhere, so 

M ({A:F'(A) = 0}) = m({A:G'(A) = 0}). (39) 
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Alas, the above fading process is stationary but not ergodic. 

In the following, we exhibit a fading process that is stationary & ergodic and satisfies ([55]) . 

Let 

!. . . , 0, 1, 0, 1, . . . with probability ^ 
" (40) 
...,1,0,1,0,... with probability |, 

and let {B k , k G Z} be a zero-mean, variance-2, circularly-symmetric, stationary & ergodic, 
complex Gaussian process of spectral distribution function G(-). Furthermore let {A kl k G Z} 
and {-Bfe, fc G Z} be independent of each other. We shall consider fading processes of the form 

H k = A k -B k , fcGZ. (41) 

Note that {H kl k G Z} is of zero mean, and its law has a mass point at zero 

Pr[|fl*| > 0] = Pv[A k = 1] = ~, fcGZ. (42) 

We first argue that {H k , k G Z} is stationary & ergodic. Indeed, {A^, fc G Z} is stationary 
& ergodic. And since a Gaussian process is ergodic if, and only if, it is weakly-mixing (see, e.g., 
[I2l Sec. II]), we have that {B k , k G Z} is stationary & weakly-mixing. (See [13l Sec. 2.6] for a 
definition of weakly- mixing stochastic processes.) It thus follows from [14, Prop. 1.6] that the 
process {(A k ,B k ), k G Z} is jointly stationary & ergodic, which implies that {H k , k G Z} = 
{Afc • Bfc, A: G Z} is stationary & ergodic. 

We next demonstrate that G(-) can be chosen so that {H k , fcGZ} satisfies (pJ5"j) . We choose 

— , if |A| < W 
G'(A) = <( W " (43) 

0, otherwise 

for some Wg (0, 1/8), which corresponds to the autocovariance function 

E[B k+m B£\ = 2 sinc(2Wm), m G Z. 

Here sinc(-) denotes the sinc-function, i.e., sinc(x) = sin(7ra;)/(7rx) for \x\ > and sinc(0) = 1. 
Using that 

E[^4fe +m A^.] = - I {m is even} , m G Z 

(where I {statement} is 1 if the statement is true, and otherwise), we have for the autocovari- 
ance function of {H kl fcGZ} 

E[H k + m H k ] = £[A k+m B k+m A* k Bl] 

— ^-[A k + m A* k ] E[B k+m Bl] 

= l{m is even} • sinc(2Wr7i), m G Z, (44) 

and the corresponding spectrum is given by 

f ^, if|A|<Wori-W<|A|<i 
F'(X) = \ ' ' (45) 

I 0, otherwise. 

Evaluating the Lebesgue measure of the set of harmonics where F'(X) = 0, we have 

/i ({A: F'(X) = 0}) = 1 — 4W, (46) 

and it follows from (l42l) that 



Pv[\H k \ > 0] = i < M ({A: F'(A) = 0}) , for W< i. 

Thus there exist stationary & ergodic fading processes whose law has a mass point at zero and 
that give rise to a capacity pre-log that is strictly smaller than the pre-log of a Gaussian fading 
channel of equal spectral distribution function. 



8 



5.3 Proof of Note H 

To prove NoteJH we first notice that, since the phase noise is memoryless, the derivative of the 
spectral distribution function is 

F'(X)=1, -\<^<\- 

Hence the capacity pre-log of the Gaussian fading channel of spectral distribution function F(-) 
equals 

n G=M ({A:F'(A) = 0}) = 0. (47) 

It thus remains to show that the pre-log of the phase-noise channel with memoryless phase 
noise is equal to 

n = i. (48) 

In |15j it was shown that at high SNR the capacity of the phase-noise channel under an average- 
power constraint on the inputs is given by 

C Avg (SNR) = \ logfl + + o(l), (49) 

where o(l) tends to zero as SNR tends to zero. (The subscript "Avg" indicates that the inputs 
satisfy an average-power constraint and not a peak-power constraint.) Since any distribution on 
the inputs satisfying the peak-power constraint © satisfies also the average-power constraint, 
it follows that C(SNR) < C Avg (SNR) and hence 

n < \ (50) 

To prove (|48|) it thus suffices to show that II > i. To this end, we first note that, since the 
phase noise is memoryless, we have 

(7(SNR) = sup/(X 1 ;Y 1 ), (51) 
where the maximization is over all distributions on X\ satisfying with probability one 

|*i | < A. 

We derive a lower bound on C(SNR) by evaluating the RHS of (fSTj) for X\ being a zero- 
mean, circularly-symmetric, complex random variable with \X\\ 2 uniformly distributed over the 
interval [0, A 2 ] . We have 

IiXx-Yx) >/(X i; |li| 2 ) 

= h(\Y 1 \ 2 )-h(\Y 1 \ 2 \X 1 ) 

>h{\X 1 \*)-h{\Y l \ 2 \X l ), (52) 

where the first step follows from the data processing inequality [H Thm. 2.8.1]; and the last step 
follows by the circular symmetry of X\ p~5l p. 3, after eq. (20)]. 

Computing the differential entropy of a uniformly distributed random variable, the first term 
on the RHS of becomes 

h(\X x f) =logA 2 . (53) 

As to the second term, we note that, for a given X\ — Xi, the random variable 2/cr 2 |Yi| 2 has 
a noncentral chi-square distribution with noncentrality parameter 2/cr 2 |a;i| 2 and two degrees of 
freedom. Its differential entropy can be upper bounded by [151 eq. (8)] 



^(l^il 2 | A X ) < IE 



!<>}■( 47re(2 + 2-lpr 1 | a 



lOg^T 



< ilogU^ e (2 + 2A A 2 ) j -log^, (54) 
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where the last step follows because < A with probability one. Combining (f53"| and (|5^|) 
with ([52")) yields thus 

I{X X )Yx) > ilogSNR + o(logSNR), (55) 

where 

lim Q j l0S c SNR) =Q- 

SNR^oo log SNR 

We finally obtain the lower bound 

n>i 

- 2 

upon dividing the RHS of ([55]) by log SNR and letting then SNR tend to infinity. 



6 Extension to MISO Fading Channels 



Theorem[T]can be extended to multiple-input single-output (MISO) fading channels with mem- 
ory, when the fading processes corresponding to the different transmit antennas are independent. 
For such channels, the channel output Yk G C at time fcgZ corresponding to the channel input 
x/j G C" T (where Ht stands for the number of antennas at the transmitter) is given by 



Zk 



Y k = Hjx* 

where EL = (h£\ . . . , H ( k nT) y, and where the processes 



k G Z, 



(56) 



{H ( k L> , keZ}, {H[ z \ kez}..., {H { k nT> , k g z} 

are jointly stationary & ergodic and independent. We assume that for each t = 1, . . . ,nx the 
process {Hj?\ k e Z} is of mean dt, of unit variance, and of spectral distribution function F t (-). 
We further assume that 



Pr 



H, 



(i) 







Pr 



H 



(2) 







Pr 



H 



(n T ) 



Q 



o. fcez. 



(57) 



The additive noise {Zk, k e Z} is defined as in Section 

The capacity of this channel is given by ([3]), but with X™ replaced by X™, and with the 
peak-power constraint ^ altered accordingly: 



||Xfe|| < A with probability one, 
where llall denotes the Euclidean norm of the vector a, i.e., 



k G Z, 



\ 1=1 



(oi, . . . ,a L ) T . 



Let 5 denote the pre- log of MISO fading channels. Following ©, we define H as 

-a C(SNR) 

" SNR^oo log SNR' 



(58) 



(59) 



(60) 



For Gaussian fading, i.e., when {H^ — d t , k G Z}, 1 < t < nx are circularly-symmetric, complex 
Gaussian processes, the pre-log was shown to be given by [El Cor. 13] 



S G = max M ({A:i^(A)=0}). 

1 < £ < nT 



(61) 



(A proof of this result can be found in [TTJ, Sec. 7.2.2].) 

Proving that the capacity pre-log 3 of MISO fading channels is lower bounded by the pre- 
log of the MISO Gaussian fading channel of equal spectral distribution functions — namely 
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Fi (•),... ,F nT (-) — is straightforward. Let H t , 1 < t < n T denote the capacity pre-log of a 
single- antenna fading channel with fading process {H^\ k g Z}, and let 

t* = arg max ILj. 

l<t<n T 

By signaling only from antenna t+ while keeping the others silent, we can achieve the pre-log 

n tt , so 

3 > max n f . (62) 

l<t<n T 

Theorem Q] yields then 

U t >fi({X:F^X)=0}), l<t<n T , (63) 
which together with (|62[) proves the claim 

S> max /i({A:F t '(A) = 0}). (64) 

l<t<riT 



7 Summary and Discussion 

We showed that, among all stationary & ergodic fading processes of a given spectral distribution 
function and whose law has no mass point at zero, the Gaussian process gives rise to the smallest 
capacity pre-log. We further showed that if the fading law is allowed to have a mass point at 
zero, then the above statement is not necessarily true anymore. Roughly speaking, we can say 
that for a large class of fading processes the Gaussian process is the worst. This demonstrates 
the robustness of the Gaussian assumption in the analysis of fading channels at high SNR. 

To give an intuition why Gaussian processes give rise to the smallest pre-log, we recall that 
for Gaussian fading [TJ eqs. (33) & (47)] 

C(SNR) = log 2 , 1 l /qMR . + o(logSNR), 

where e^ re(i (S) denotes the mean-square error in predicting the present fading Hq from a variance- 
(5 noisy observation of its past + W-\,H-i + W-2, ■ ■ ■ (with {Wt, k G Z} being a sequence 
of IID, zero-mean, variance-<5, circularly-symmetric, complex Gaussian random variables). Thus 
for Gaussian fading the capacity pre-log is determined by £p red (l/SNR), and it is plausible that 
also the pre-log of non-Gaussian fading channels is connected with the ability of predicting 
the present fading from a noisy observation of its past. Since, among all stationary & ergodic 
processes of a given spectral distribution function, the Gaussian process is hardest to predict, it 
is therefore plausible that the Gaussian process gives rise to the smallest pre-log. 
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